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(57) Abstract; A system and melhods fnr 
capturing, storing and retrieving cusiomcr 
faco-tn-facc frontal inleracdnns characterizing 
walk-in environments, for the purpose of quality 
managcmeaL The system compriies interaction 
capcme and storage unit, which includes at 
least one of screen capture, storage and retrieval 
component or, voice capture, storage and 
retrieval component or, video capture, storage 
and leirieva] component. The system conq»ising - 
a set of recording and information gelhering 
technics suitable for Walk-in eovironmeDts 
that win unable organizations to lecoid retrieve 
and evahiaiB the fiontal interactions with their 
customers. 
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RECORDING AND QUAUnTY MANAGEMENT SOLUTIONS 
FOR WALK-DSrENVmONMBmS 

RELATED APPLICATIONS 

ITie present invention rsJates and claims priority from US provisional patent 
^pUcation serial number 60/317,150 titled QUALIIY MANAGEMENT AND 
KBCXfROmG SOLUTIONS FOR WALK-IN C3BNTERS, filed 6 September 2001. 

The present invention relates to PCT patent plication serial number 
PCT/ILO2/O0a97 titled A METHOD FOR CAPTURING, ANALYZING AND 
RECORDING THE CUSTOMER SERVICE REPRESENTATIVE ACnVITlES 
filed 12 March, 2002, and to PCT patent qjpEcation serial number PCT/IL02/00796 
titled SYSTEM AHJD MEIHOD FOR CAPTURING BROWSER SESSIONS AND 
USER ACTIONS filed 24 August 2001, and to US patent application serial number 
10/056,049 titled VIDEO AND AUDIO CONTENT ANALYSIS SYSTEM filed 30 
January 2001, and to US provisional patent application serial nvunber 60/354,209 
titled ALARM SYSTEM BASED ON VIDEO ANALYSIS filed 6 February 2002, 
and to PCT patent qjplication serial number PCT/IL02/00593 titled METHOD, 
APPARATUS AND SYSTEM FOR CAPTURING AND . ANALYZING 
INTERACTION BASED CONTENT filed 18 July 2002, the content of wMdi is 
bBcsby incorporated by reference. 

BACKGROUND OF THE INVENTION • 
FIELD OF THE INVENTION 

The presort ioveation relates to capturing, stcning and retrieval of 
synchnmized voice, screen and video interactions, in general and to methods for 
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triggering of recording, to Customer Experience Management (CEh*) and 
interactions capturing for quality management (QM) purposes, in particular. 

DISCUSSION OF RELATED ART 

A major portion of the interaction between a modem business and its 
customeis are conducted via flie CaU Center or Contact Center. These somewhat 
overl^ping terms relate to a business unit which manages and m nin t niT i s 
interactions willi the business' customers and prospects, whefher via means of 
phone in the case of liie CaU Center and/or through con5)Uter-based media such 
as e-mail, web cha^ collaborative browsing, shared whiteboards. Voice over IP 
(VOff), etc. These electronic media have transformed 4e Call Center into a 
Contact Center handling not only traditional phone calls, bat also a complete 
multimedia contacts. 

Digital voice, data and sonjedmes scieen recording is common practice in 
CaU Cent^ and Contact Centers as well as in tradfaig floors and in bank 
branches. Such recording abiUties are ^ically used fat compUance purposes, 
when such recording of Ihe intwactions is required by law or other means of 
regulation, risk management, limiting the businesses* legal eaposure due to false 
allegations regarding the content of ttie interaction or for quality assurance using 
tije re-creation of the iotBraction to evaluate an agent's padBMmance. 

• 

Current systems aie focused on recordiBg phone calk such as Voice, VODP 
and computer based mteractions with customers such as e-mails, chat sessions, 
ooUaborative browsing and the like, but are failing to address the recordmg of the 
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most common interactions, the ones done in walk-in environments where the 
customer has a firontal, fece-to-face, interaction with the company representative. 
This solution is referring to any kind of frontal, tace to fece point of sale or service 
from service centers through branch banks> fast food coimters and the like. 

5 

As mentioned earlier there is no current sobtion for recording, for quality 
management purposes and for content related business analytics, of the most 
common interactions - the ones done in a walk-in environment such as walk-in 
centers, branch banks, stores aod maay other private, commercial or government 

10 points of presence, where a person has a frontal interaction with an agent This is 
refeiring to any kind of service-providing center. Non-limiting examples are service 
centers, fast food counters, check-in counter, any Over The Counter (OTC) fece-to- 
&oe provided services and the like. Defining an agent to be assy professional 
representative of a business or government providing a sCTvice to a customer or 

15 civilian. Non limiting examples would include: a derk in a store, a banlcer, a tax 
authority representative servicing representatives at IRS ofiSces, a ground agent 
checking a passenger in for a flig^ and the like. 

The problem that the currsnt known in the art solutions are &ced with is a 
20 conceptual one as well as a technological one. The basis for a recording of an 
interaction includes an identified beginning and end. Phone call, email handling and 
web collaboration sessions all have a defined beginning and end that can be 
identified easily. Fuifiiemore, most technological loggmg platforms enabte tke 
capturing of interactions and thus are able to provide additional information about 
2S the int^otion. In frontal center there are no means of reporting of begioning and 
end of interactioos, nor the ability to gain additional information about tite 

3 
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interaction that would enable one to associate this "additioiial information" to it and 
to act on it In referring to "additional infonnation*' we refer to information such as 
who the customer (or civilian) is, how long he or she has been waiting in line to be 
served, what service flie customer intended to discuss when reaching the agent and 
5 so on and so forth. Informatron like &is is readily^ available and common^ used in 
recording phone calls and can be obtained by CH (Computer Telephony Integration) 
information or CDR/SMDR (C^ Detail Reporting/Station Message Details 
Recording} connectivity. For email and other media this has been achieved by 
integrating the enabling platform, using a proprietary protocol of some sort with the 
10 recording platfonn. By virtue, the waDc-in environment's characteristic is of people 
seeking service that come and leave acc(»rding to tiie queue and there is no enablmg 
platfonn for &e communication. 

Anottier problem is how to record sudi intmctions since there is no line of 
15 communication between both sides. Additional aspect of Ihe problem is the fact &at 
the interaction in a walk^in environment has a visual aspect, which does not typically 
exist in remote communications discussed above. Hie visual^ face-to-&ce interaction 
between agents and customeis (or civilians) is irnportant in this environment and 
therefore should be recorded too. 

20 

The present solution deals wi&'fhe described problems 1^ solving the 
obstacles presented, providing a mefbod for &ce-to-face recordings storiug and 
retrieval, organization will be able to provide abilities as to ^orce quality 
management exercise business analytic techniques and as direct consequence 
25 enhance quality of services m its remote branches. 



4 
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The person skifled in the art will ^jpredate that there is Iherefore a need for a simple 
new and novel method for capturing axid analyzing Walk-in, face-to-face interaction 
for quaiity manag^ent puiposes, 

5 

SUMMARY OF IHEPRESENTINVENHON 

It is an object of present invention to provide a novel method and system 
for capturing^ logging and retrieval of &ce-to*'&c6 (fiontal) interactions for lite 

JO purpose of further analysis, by- overcoming known technological obstacles 
characterizing the commonly known ^'Walk-in^ environm^ts. 

Bi accordance wxdi the present invoation, there is thus provided a system for 
captuiing lace-to-face int^actioa comprising uatemction capturing and storage unit, 
microphmies (wired or wireless) devices located near the parties interacting and 

15 optionally one (or more) video camera. The system interaction c^ture and storage 
unit finther comprises of at least a voice capture, storage and retrieval component 
and optionally a screen capture and storage cornponent for screen shot and screen 
• events interaction cq>turing, storing and retrieval, video capture and storage 
component for capturing, storing and retrieval of lite visual streaming video 

20 faiteraction. In addition a database component in which mfonnation regarding the 
mteraction is stored &r later analysis is required, non- limiting example is interaction 
in&rmation to be evaluated fay team leaders and supervisors. The database holds 
additional metadata related to the interaction and any information gathered fiom 
external source, non-limiting example is infonnation gathered from a 3"^ party such 

25 as from Customer Relationship Management (CRM) plication, Queue 
Management System, Work Force Management Application and tiie like. The 
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database component can be an SQL database with drivers used to gather this data 
fiom smrounding databases and components and insert ftds data into the database. 

In accordance wifli the present invention a variation system would be a ^stem 
5 in which the capture and storage elements are separated and interconnected over a 
LANAVAN or any other IP based network. In such an implementation the capture 
component is located at &e location at which Ihe interaction takes place. Hie storage 
component can either be located at ttie same location or be centralized at another 
location covering multiple walk-in environments (branches). The transfer of contsmt 
io (voice, screen or other media) from the capture component to the storage component 
can eifiier be based on proprietary protocols such as but not luniting to a unique 
packaging of RTP packets fertile voice or based on standard protocols such as IL323 
for VoIP. 

15 In accordance with the presaat invention, tiierc is also provided a method for 

collecting or generating information m a CTI less or CDR feed less '^walk-in*^ 
environment for separating the media stream into interactions representing 
independent customs interactions and for generating additional data known as 
metadata desrabing tiie call Hie metadata typically, provides additional data to 

M describe tiie mteractions entry in tiie database of recorded interactions enabling fast 
location of a specific interaction and to derive recording decisions and flawing of 
interactions based on tixis data (non limiting example is a random or rule based 
selwstion of interaction to be recorded or flagged for the purpose of quality 
manag^ent)* 



6 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention disclosed a new methods and system for capturing^ 
storing, retrieving face^o-face interactions for the purpose of qualify management en 
5 Walk-in environment 

The proposed solution is a set of recording and iafonnation gethering 
technics, creating a system solution for Walk-in Environments fiiat will enable 
organi2adons to record retrieve and evaluate the frontel interactions with flieir 
to customs. Such &ce-to^&ce interactions might be interactions that customers 
experience on a daily bases saoh as in fast food counters, banking, point of sale and 
the like as well as diose int^actions that are 1ko more complicated to handle, cases 
were customers like in the case of service centers come physically to Hn^ supplier of 
the service once they ^gave up" on otiier means of communication. 

15 

The present invention will be understood and ^predated from &q following 
detailed ^cxiption taken in conjunction with the drawing of figure 1. In figure 1 a 
typical high-level diagram solution &r walk-in centers is shown. The system I 
describes a process flow, starting from the &ce-to-iac6 interaction between parties 

20 and ending in an a]>plication that benefits fiom all the recorded, processed and 
aoafyzed information. The agent 10 and the customs II are representing the pardes 
engaged in flie iateractian 21, interaction 21 is candidate for iurther capture and 
evahiation. biteraction 21 in Ifae context of the present embodiment is any stream of 
information exchanged between the parties during &ce-to-face communication 

25 session whether voice oqytured by microphones, computer information captured by 
screen shots fit>m the agraf s woricstation or visual gestures captured by video finom 
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cameras. Hie system includes interactioa capture and storage unit 15 which includes 
at least one voice capture and storage component 18 for voice mtemction capturing, 
storing and retrieval as a non-limiting example NiceLog by NICE Systrais Ltd* of 
R'annana, Israel, and optionally one or more screen capture and storage components 
5 17 for screen shot and screen events interaction c^turing, storing and retrieval such 
as a non limiting sample NiceScreen by NICE Systems Ltd. of Raanana, Israel, one 
or more video capture and storage- component 20 for capturing, storing and retrieval 
of the visual streaming video interaction coming from one, or more, video camem 
13, a non-limiting example such as NiceVision by NICE Systems Ltd., and a 

10 database component 19 in which information regarding the interaction is stored for 
later query and analysis as non limiting example NicCLS by NICB Systems Ltd. of 
Raanana, Israel. A variant or alternative solution for the purpose of biandi recording 
is where the capture and storage elements are separated and interconnected over a 
LANAVAN or any other IP based local or wide or other networic In such an 

15 implementation the capture component is located at the location at which the 
intemcdon takes place. The storage component, which includes the database 
component 19, can either be located at the same location or be centralized at another 
location covering multiple walkrin envhonments or branches. The transfer of content 
voice, screen or other, media fiom the capture componeDt to the storage component . 

20 can either be based on proprietaiy protocols such as a unique packaging of RTF 
packets for the voice or based on standard protocols such as H.323 for VoIP and the 
hke. 

hi Older to capture ffae voice, two 12 omni-directional microphones are 
25 installed directed at bo& side of fte interaction, agent 1 0, customer 1 1. Altematefy, a 
single bi-diiectional microphone may be used. Once captured voic^ screen and 

8 
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video recordings are stored in an Interaction capture & storage unit 15^ the 
information is stored in a database 19 and may ei&er be recreated for purposes such 
as dispute resolution or be further evaluated by team leaders and supervisors 16 
using for example by the NiceUniverse application suite by NICE Systems Ltd. of 
5 Raanana, Israel. ITie suggested solution enables capturing of the intieraction with 
microphones 12 and video cameras 13 located in tiie walk-in service center. It should 
be noted that the video 20, voice 18 and the screen 17 capture & storage components 
are synchronized by continuonsfy synchronizing their clocks using aay time 
synchronization method for wBinph by using as a non limiting sample the NTP - 
10 Network Tune Protocol or IRIGKB. 

One of the major challenges in a walk-in face-to-fice interaction environment 
is the lack of the CII or CDK feed. This is limititig not ooly since it is needed to 
separate the stream inlo interactions representing independent customs interactions 

IS but also since the data describing the call is required for other uses. This data, 
referred to as metadata can indude the agents name or specifio ID» the customer 
name or speoifio ID, an accomt number, the department or service the int^ction is 
related to, various flags such as to indicate if a txai^action was completed or if the 
case has be^ closed in' addition to the beginning and rod time of the interaction. 

20 Ibis is iho type of information one usual^ receives from the CTI link in telephony 
centric mtoraction but is not available in this environment due to the &ct ftiat an 
interaction-enabling platform^ sudi as tekphoiQ^ switch^ is not required. 

The metadata is typically used for three uses: firstly to determine the 
25 beginning and end of the interaction, secondly to provide additional data to describe 
the interactions entry in the database of recorded interactions for enabling &st 
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location of a specific interaction and thirdly, to drive recording decisions and 
flagging of interactions based on this data. 

Here are solutions offered to overcome these lluce obstacles, regarding the 
detennination of beginning and end of recording. The use of (a) what can be defined 
as '"Block Of rime** recording, were time intervals are predefined for the IntBraction 
capture and storage unit IS to record all interactions taking place at that particular 
time periods, (b) Screen event driven recording can define the start or end of 
recording based on an event / action made in flie ^plication running on llie agenfs 
desktop which is typical or representative of the start or end of an interaction or of a 
part or interaction which is of interest Non-limiting examples are launching of a new 
customer screen in the CRM ^plication, agent opening a new customer file, or 
inviting next customer in line by clicking on the 'TPJexf' button in tiis queue 
management system application, or whenever a discount of more then $100 is 
entered into a CRM application's designated data field, or whenever a specific screen 
is loaded start recording. Screen activily is captured by scareen capture and 
storage component 17,The screen event c^turing agent action is fully described in 
co-pending PCT patent applicatioH serial number PCTyTL02/00197 titled A. 
MEIHOD FOR CAITUOEUNG, . ANALYZING AND RECORDING THE 
CUSTOMER SERVICE REPRESENTATIVE ACIIVITIBS filed 12 March, 2002, 
and in PCT patent application serial number PCT/IL02/00796 titled SYSTEM AND 
METHOD FOR CAPTURING BROWSER SESSIONS AND USER ACHONS 
filed 24 August, 2001 both are incorporated herein by reference. Furthermore, by 
correlating the screen events with voice content analysis one can reach a higher level 
of accuracy for example by idraatifying the end of the interaction by the agent saying 
"next" and at a near time closing ttie customer's file in the CRM application, (c) 
Selective recording based on real time video content analysis is anodier solution &>r 

10 
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detBimining start and stop sessions as well as the complete identification of Ihe 
parties interacted. An example of using face recognition algorithm is e)q)Iained in 
detail in VIDEO AND AUDIO CONIENT ANALYSIS SYSTEM, vMoh is 
incorporated herein by referencei detailed of application stated below. Algorithm 
5 nnming for example on NICE propriety hardware / firmware DSP's based boards or 
on (OTS) Off-The-Shelf board uploaded with known in Ihe art other fece recognition 
algoritiinis. As motioned earlier the video, agent screen and voice are time 
synchronized and as soch the start and end of interaction is deterministic. Frame 
presence detection defines a video fiame to trigger recording whenever a person is 

10 detected (co-esdst) fin: more then x seconds^ when video frame enqsty then stop 
recording (similar to energy level detedidn in Voice recording). Frame xsoxaent 
manipulations are inherent in NICE VISION Product of NICE Systems Ltd. 
Example of capabilities of object/ people video content-based detection can be fbnnd 
in co-pendmg US provisional patent. q>plication scsial nmnber 60/354^09 titled 

15 ALARM SYSTEM BASED ON VIDEO ANALYSIS, filed 6 Februaq^ 2002 which 
is incorporated herein by reference. As mentioned flie video signal cgrptoring & 
storing coirponent 20 recording is triggered selectively nsing face recogniticm for 
example recording pre-defined customers such as VIP customers, or only customers 
that their pictures are already stored in organization database 19 any type of 

20 recording (total / selective) according to the service provider preferences. Preferably 
any pre-detranine content of video can be used to identify start^stop the recording of 
fiontal interaction. Coverage of video content ana^is is described in details in co- 
pending US patent application titled: VIDEO AND AUDIO CONTENT ANALYSIS 
SYSTEM, serial number 10/056,049 dated January 30, 2001 stating Ihe real-time 

25 capabilities based on video content analysis done using Digital Signal Processing 
(DSP/s) which is incorporated herein by reference, (d) The use of ROD (Record On 
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Demand) is another solution for deterroining, or in this particular case manually 
controlling the start and end of interactionA'ecording. Wilh ROD the agent can start 
and stop recording according based on his needs; For example wbenevra a deal is 
takmg place he vdll record it for compliance needs, but he will not record when fte 
customer only came to ask a question. The actual trigger of iho recording can eith^ 
be perfonned by a pineal switdi connecting and disconnectrng the microphones 
from fte capture device oc by b, soitware ^plication running on the agent's 
computadon device, (e) Total Recording is a straightforward solution to mean« 
record and store all calls during woridng houis of tiie service center, pre&rably if 
worlc force management system exist on site it can be integrated as to provide all 
agent* s working periods and brake NICE SYSTEMS Ltd. integradon widi Bhie 
Pumpkin Soflwaic Inc. of Sunnyvale California is a non-limiting example of using 
woridng hours information to calibrate scheduled based recordiQg. (f) API Level 
integration with host applications in the computing system is anotb^ example of 
providing control c^abilities on xvh&i start and end recording is set Several 
capabilities can be achieved setting start and stop API commands, setting routing 
calls command and tiie lik^ NQn-limiting example is the provider of CRM, Siebe] 
Systems, Inc. of San Mateo, California, certified Integration witfa NICE SYSTEMS 
(hat consequentiy provided recording capabilities embedded within Siebel's 
Customer' Relationship Management solution applications. Using ActiveX 
components or other means of conmiand delivery, information can be inserted into 
the scripts of any host application the agent uses it m order that ^en he begins 
handling tiie customs tiie recording is started and when the handling ends it is 
stopped, (g) Integration with Queue Management S^^stems is a geaiuine solution for 
triggedi^ and automatically controlling the start and stop recordmg. Queue 
management systems commonly control the flow of customer through waDc-in 
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environments. By integrating with such systems one can know when a new customer 
is assigned to an agent and the agenf s position. Hence, by integrating with the queue 
management system we can undearstand when the mteraction begins and if next one 
in queue deduces that previous interaction has ended. By deducting Ihis we can 

3 trigger start and stop reoMdmg based on the status the queue management system 
holds for the agent An eacample of a Queue Management System would be sohitions 
(hardware and software) by Q-MATIC Corporation of Neongatan 8 S-43153 
MoMdal, Sweden. It wffl be evident to the person skifled in the art that any 
oombinalian of fhe.above options (a) to (g) is contemplated by the present invention. 

10 In addition, recording of sitence can be avoided using either VOX activity 

detection for detomine microphones activity or by using, later discussed in detail, 
video content to detect customer present in ttic (ROI) Region Of Interest covered by 
camera or either using screen and computer information to determine agent activity 
for example whether agent the is logged of^ and flie like scenarios. The different 

15 algorilhms are parts of the respective components 17, 18, 20 constituting the 
mteraction capture and storage units 15. Agents can also avoid recording if they tmm 
off their nucrophones yibesi ihey are not working. 

Deternunrng the begtaning and Old oflfao hjteractiott vm desor^ 
20 in the previous paragr^ Now to foe second obstacle, namely 4e problem of 
generating foe metadata .for describing foe mtearactiona entry in foe database of 
recorded interactions, for foe purpose of enabling fest query on foe location of a 
sperafic interaction as well as to drive recording or interaction flagging decisions and 
for ferfo« analysis purposes. Metadata «)lIecfion is one of foe major challenges in 
25 Walk-in fiice-fece-fece recording eavhxmmenb characterized by foe ladc of foe CTI 
or CDR/SMDR feed. This is limiting not oafy because it is needed to separate foe 

13 
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interactions, previously discussed, but also because the data describing the call is 
required for other uses. This data, referred to as metadata can include the agents 
name or specific ID, &e customei* name or specific ID, an account number^ the 
department or service the interaction is related to, various flags such as if a 
5 transaction was completed in fiie interaction or if the case has been closed, in 
addition to the begixming and end time of tfie interacdon. This is file type of 
information one usually receives fiom the CTE link in telqihony c^iiric interaction 
but it is not available in this kind of fcontai interaction based environment due to fiie 
fact that an interaction^abhng platform, such as telephony switch, is not required. 

10 As mentioned &e metadata is typical^ used for defining the beginning and end of 
the interacdon. It is also used for providing additional data to describe tfie 
interactions entry in the database of recorded interactions to enable fast location of a 
specific interaction. And, finally to drive recording decisions and flagging of 
intem^tions based on this data. An example for recording decisions are random or 

15 rule-based selection of inteiactions to be recorded or flagged for the purposes of 
quality management A typical selection role could be two interactions per agent per 
week, or one custom^ service interaction and one sales interaction per agent per day 
and one mteraction per visiting customer p^ month. As the start and end of 
mteraction was described in detail in the previous pamgraph, the remaining me^ata 

20 gathering of interaction's related information is accomplishBd using the following 
mefoods. (a) By loggmg tiie agent network login for exanrple Novell or Microsoft 
logut or supplying the agent an application to log-into die system, it is possible to 
ascertain which agent is using the specific position recorded oa a specific chaxmel 
and thus associate the agent name with the reccsding. (b) Again, as before capturing 

25 data on the agent's scareen or fiom an application running on the computing device, 
either by integrating API commands and controls into the scripts of the application or 
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by using scree^x analysis as shown in PCX co-pending patent application serial 
nximber PCT/IL02/00197 titled A METHOD FOR CAPTURING, ANALYZING 
Am RECORDING THB CUSTOMER SERVICE REPRESENTATIVE 
ACnvmES filed March 12, 2002 and in POT co-pending patent application serial 
5 number PCT/IL02/00796 titled SYSTEM AND METHOD FOR CAPTURING 
BROWSER SESSIONS AND USER ACllONS filed August 24, 2001 both are 
incoiposrated herein by rrference. When provided ia real time this can be used for 
real-time triggering of recording based on fte data provided but more important it 
may be used to extract metadata fiom an existing applicalion and store .it in tiie 

10 database conq)onent 19. (c) By adding a DTMF generator and a keypad to fhe 
microphone mixer and/or an^Iifi^ enabling liie agent or customer, to key-in 
information to be associated w&k the call such as customs ID or commands such as 
start or stop recording and the like. The DTMP detection function, which is a known 
in tfie art algorithm and typically exists in digital voice loggia is then used for 

15 recognizing the DTMF dig^ generated command or data and then the command is 
either executed or da& is stored and related to &e recording as metadata. 

In addition, the system may be coupled and share resources with a traditional 
telephony environmrat recording and quality management solution for example: 

20 NiceLog, NiceCLS and NiceUnivctse by NECB Systans Ltd, of Raanana, Israel. In 
such an implantation where two recording solutions co-exists part of &e recording 
resources for voice and screen are allocated for recording of phone lines part for 
frontal &c6-to-&ce cs^turing device recording and events and* additional information 
for these lines, are gathered through CTT integration. Id such an environmeait one can 

25 then recreate all interactions related to a specific data element such as all interactions 
both phone and irontal of a specific custcmier. This can include, for example, the 
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check-in and checkout of a hotel guest in conjunction with his calls to the room 
service line. 

Due to the feet 'fliat frontal interaction may take place in environments with 
relativeiy high levels of noise there is a need to address Hho issue of audio quality and 
to provide improvement of the audio quality. In some environments simply using a 
multi-^ihectional raicaophone will be sufficimt However, in environmeots with 
significant levels of ambient noise and interferences from neighboring positions a 
solution must be given to enable a reasonable level of understandability of the 
recorded voice. Solutions can be divided into ftree kinds: (1) Sohdions external to 
the capture and recording i^paiatus, these Idbid of sohrtiQns include solutions for 
ambiraC noise reduction that are known in Ifae art and use specialized micraphones or 
microphone arrays with noise canceling functions. (2) Solutions wittun fte. capture 
and recording apparatus^ which include noise reduction fimctions, performed in the 
capture and logging platform either during playbadc or during preprocessing of the 
input signal as shown in co-pending PCX patent application serial number 
PCr/lL()2/00593 titled METHOD, APPARATUS AND SYSTEM FOR 
CAPTURING AND ANALYZING INTERACTION BASED CONTENT 
tiled July 18, 2002 incorporated herein by refisrence. Fur&eraiicnre^ as part of the 
audio classification process in the preprocessing sta^ described in detailed in this ' 
co-pending PCT pateait qiplication figure 4> filtering of background elemeats such as 
music, keyboards clidss and the like is discussed. (3) Another solution uses halt (1) 
and (2) solutions fix)m above - &e external and the internal noise reductioa It ofiEers 
a split betwem capture and recording apparatus and the oivironment external to this 
apparatus. This wculd include any combmadon of sohxtions presented in (1) and (2) 
for example a solution in which two directional microphones are pointed towards the 
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customer and agent respectively, their signal enter the capture and logging platform 
where the sound commoB to both is detected and negated from both signals. Then 
both signals are mixed and recorded. They can also remain separated and be mixed 
only i^on recreation of ttie voice - playback. Another example of a solution like tiiis 
5 is one in wbidi the two aiia:ophones are niixed/summed electronically using an 
electronic audio mixer and enter ib& capture and logging platform. In addition, an 
ambient signal is received by an additional muM-directional microphone located in 
the environment and enters Hie capture and logging platform. In the capture & 
log^g platform llie ambient noise is negated from the mixed a^nt/customer signal 
10 before recording or during playback 

La some instances it is beneficial to record video in the walk-in environment 
non- limiting examples of the advantages of using synchronized video recording on 
site were mentioned before as part of the solutions for determining start and end of 

15 interaction and for visually identifying of parties. In cases in which a single video 
camera is positioned to record each service position fte implementation of playback 
is straightforward, ie, playing back the video stream recorded at the same time or 
with a certain fixed bias fioEii the period defined as the beginning and end of tiie 
service interactions, determined as previously (Ksciissed in **frame presence 

20 detection". Other optional implementation instances would incbde an 
implementation in which two cameras are used p^ position, directed at the agent and 
customer, respectively. In this case at the point of replay the user can determine 
which video stream should be replayed or alternatively, have both play in a split 
screen. Another implementation instance would be an environment in which a strict 

25 one-to-one or many-to-one relationship between cameras and positions does not 
exist In sudti an environment the users playing back the recording selects which 
video source is played back with the voice and optionally screen recording. It should 
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be noted ftat the video and voice are synchronized by continuously synchronizing 
the clocks of video captoe & storage system with the Voice and Screen capture 
platfonn using any time synchronization method non limiting example are NTP 
NetwoA Time Protocol, IRIG-B or the like. In cases where one lacks cam^ per 
5 position, one camera can be redirected to an active station based on interaction 
presence indication. Meaning that in scenarios whete fewer cameras than positions 
exist the camera can be adaptively redirected (using camera PTZ - Pan, Til^ Zoom) 
to the active position. Note that cameras can be remotely conttolled, same as in the 
case of multhnedia remote recording vicinities. 

10 

The systems described above can operate in conjunction with all other 
elements and product apphcable to traditional voice recording and quality 
management solution such as remote playbadk and monitoring c^abilities non- 
' limiting examples of such products are Scecutive Connect by NICE Systems Ltd. of 

15 Raanana, Israel Agent eLeamiag solutions - such as KnowDev by Knowlagent Inc, 
Alpharetta, GA. This invention method and system is advantageous over existing 
sohjtions in the sense that it provides a solution for quality management of frontal 
&ce-to-&ce service environments. This enables companies to enhance their quality 
and get more information on their customer's satisfection and to propose quality 

20 management solutions to cover its branches, offering the diverse type of traditional 
recording solutions whether it is total, selective, ROD> screen event triggered 
recording and the like for frontal service environments, executive tools to enable 
remote access to monitor and listen to interaction in frie frontal service environments 
and when couple this solution with traditional telephony solution, yield fiiU coverage 

25 on customer e?q)erieQC6 for better analysis. 
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The person skilled in the art will appreciate that what has been shown is 
not limited to the description above. The person skilled in the art will appreciate that 
ejcamples shown here above are in no way limiting and serve to better and 
adequately describe 4e present invention. Those skilled in the art to which this 

5 invention pertains will t^jpreciate the many modifications and other embodiments of 
fee imfention. It vnH be apparent that the present inyention is not limited to the 
specific enabodimeols disclosed and those modifications and other embodiments are 
intended to be included witbih the scope of the mvention. Although specific terms 
are employed herein. Hiey are used in a genwic and desoqjtive sense onfy and not 

10 for pmposes of limitation. Persons skiUed In the art will appreciate fliat the present 
invention is not limited to what has been particularly shown and described 
hereinabove. Rafter the scq>e of the present taveaition is defined only by the claims, 
which follow. 
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CLAIMS 

I/We daim; 



1. An apparatus for c^turmg, storing and retrieving fece-to-face 
interaotioas in walk*in environments for the ptirpose of further analysis, 
flie apparatus comprisiag: a device for capturing and storing at least one 
&ce to &ce mteraodon cqrtured in the presence of lha parties to &e 
int^nction and a database fi>r storing data and metadata uiJSsmiation 
associated ^vifh the &ce-to-fice mteraction captured. 

2. The apparatus of claim 1 herein Ihe device for c^turing the at least 
one &ce to &oe interaction inchides a voice capture and storage 
component 

3. The ^aratus of claim I wherein the device for c^turfng flie at least 
one face to &ce intesacticm includes a screen capture and storage 
con^onent 

'4. The apparatus of daim 1 wheidn the device for capturing the at least 
one &ce to &ce interaction includes a video capture and storage 
component 

5. The apparatus of claim 1 vAi&tdn Ihe device for c^turing the at least 
one &C6 to face interaction includes a database for recording data, 
metadata associated with die said interaction. 
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6. The apparatus of claims 1-5 wherein the device for capturing flie at least 
one fece to fece intoactionL comprises of separated capture atid storage 
components int^onnected using local area or wide area or wireless or 
an IP-*based networks. 

7. The apparatus of claims 1-6 wherein the interaotibn is a man-tormachine 
interaction in which content is passed or exchanged. 

8. Hie apparatus of claims 1-6 wherein the at least one face to face 
interaction comprises any one of the following: microphone recorded 
audio interaction, video interaction, or computation device screen 
interactioiL 

9. The apparatus of claim 1 wherein the metadata infomoiation is 
information related to the face-to-face interaction wherein each 
interaction has associated metadata. 

10. The apparatus of claim 1 wherein the metadata associated witib the face- 
to-&c6 interaction is ga&iered where the interaction is not enabled by a 
telephony or a messaging platform. 

11. The apparatus of claim 1 further comprises a telephony recording or a 
quality mana^ment device. 
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12. A method for metadata gathering ia waEk-in enviromnaats, the method 
comprising: 

detemimmg the beginning and ending of an interaction assodated 
with a fece-to-face interaction; 

enabling fest location of specific interactions or to derive 
recording decisions; or for 

flagging of interactions based on said data. 

13. TTie method of clann 12 fur&er comprises the step of integrating a queue 
management ^stem wherein the queue management system provides 
data used to trigger recording of frontal fece-to-face interaction or is 
used as metadata describing fiie said interactioiL 

14. The method of claim 12 further comprises the step of mtegrating a work 
force management system wherein the data from work force 
management system is used to trigger recording of frontal face-to-face 
interaction or is used as metadata describing the said iitteractioa 

15. The method of clahn 12 finther comprises the step of capturing screen , 
' events of an agent action or data entering wherein the agent action or the 

data entering is used to trigger recording of frontal face-to-face 
interaction or is used as metadata describing tiie said interaction. 

16. The mefiiod of claim 12 further comprises the step of time 
synchronization and content analysis of at least two of video , voice and 
screca wherein the analysis results triggers or is used to trigger 
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recording of firontal &£e-to-&ce interaction or is used as metadata 
deseribing the said ixiteiaction 

17. Tbe method of claim 12 forfher comprises the step of integrating a host 
5 computer application wherein the appKcation serves as the trigger for 

recordmg of frontal face-to-fece interaction or is used as metadata 
describing the said interaction. 

18. The method of claim 12 wherein the metadata associated with the face-* 
10 to-face interaction is gathered \^iiere the mteraction is not enabled by a 

telephon3'or a messagmg platforms. 
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